CDS

Accession Number TCMCG065C32480
gbkey CDS
Protein Id XP_004983436.1
Location join(33783129..33783227,33783326..33783440,33783561..33783640,33784143..33784218,33784307..33784371,33784485..33784547,33786318..33786399,33786498..33786660,33786954..33787019,33787104..33787770)
Gene LOC101754467
GeneID 101754467
Organism Setaria italica

Protein

Length 491aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA207554
db_source XM_004983379.3
Definition U1 small nuclear ribonucleoprotein 70 kDa [Setaria italica]

EGGNOG-MAPPER Annotation

COG_category A
Description U1 small nuclear ribonucleoprotein of 70kDa MW N terminal
KEGG_TC -
KEGG_Module M00351        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko03041        [VIEW IN KEGG]
KEGG_ko ko:K11093        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03040        [VIEW IN KEGG]
map03040        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGCGACTACGGCCACGGCGGGGGCCAGGTGCGGGGCAACCCGGACTCCCGGCCCAGGGGCCAGGGGCAGCGCCCCAACGTCCAGCAGCTCAAGCTCATGGGGCAGATCCACCCGACGGGGCTCACGCCCAACCTGCTTAAGCTCTTTGAGCCGCGGCCGCCGCTCGAGTACAAGCCCCCGCTCGAGAAGCGCAAATTGCCGGCCTACACAGGGATGGCACACTTTGTGTCGCACTTTGCTGAGCCCGGGGATCCGGAATATGCTCCGCCCGTGCCCAAGTGTGAGACAAGGGCTGAAAAGAAGGCTAGGATTCGTGATAATAAGCTCGAGCAAGGTGCAGCTAAGGTTGCTGAAGAGCTTCAGAAGTATGACCCACAAAGTGACCCCAATGCCACTGGTGACCCATACAAAACGCTCTTTGTTGCAAGACTTAATTATGAGACGTCTGAGAACAAGATCAAACGGGAGTTTGAAGCTTATGGGCCTATTAAAAGGGTTCGGCTTGTAACTGAGAAGGATACAAGTAAACCGAGAGGATATGCTTTCATAGAGTACATGCACACACGGGACATGAAAAATGCCTACAAGCAGGCAGATGGGAGAAAAGTGGACAATAAAAGGGTATTAGTTGATGTTGAGCGTGGCAGAACTGTTCCGAATTGGCGTCCCAGGAGATTGGGTGGTGGACTGGGATCAAGCAGGATGGGTGGTGCAGAGACTGATAAAAAGGATTCTGCTAGGGAGCAGCAGCAGGGTGGGCGTCCCAGATCAGAAGAGCCTAGGAGGGATGATCGACGTGCTGATAGGGATCGGGAGAAGTCCCGTGAAAGGGTACGGGAAAGAGACCGTGATGAAAGAGCCCGTGAGCGTTCACATGACCGGACTCGTGATCGTGATTCACGAGAAGAGAAGCATCACCATAGAGACCGTGAGAGGACTAGGGACAGGGAGAGAGGAAAGGACCGGGAAAGAGAGCATGGTCGTGATCGTGATCGTCGTGACAGAGACAGGGACAGGGATCGCGGCCGTGACTATGAAAGAGAAACGGACCGGGCTCGCTCTCATGATCGCCATCGTGAGAGGGGCAGGGATCGTGGTGAAAGAGATTATGAGCGCACCAGTCACGAACGTGACCGTGGCCACAGGCACGAGAGGGATGCGGACTATGGCAATGGTGGGCCAAAGCATGACAAAAATCTGTCCAGTTACGGGCAGGATTATGGCTATGGTCAGTATGAGCAACACAAGGGTCATGAGGCATATGGTTATGGTCAAGATGGACGTGGGCATGAAACTGAGCACTCGAAGCGGCATGATCAGGAGTATTATCGTGTTGACTCGTACAGTAAAATGGAAACCAACTATCAGGTGCAGCCTAACAATGCTGAACCTGAAGGGCCTGAGGAAGGAGAGGCATATGAGGAAGGCGACTACCAATATCACCGAGCAGGTGAACACATGAATGATGCTTGA
Protein:  
MGDYGHGGGQVRGNPDSRPRGQGQRPNVQQLKLMGQIHPTGLTPNLLKLFEPRPPLEYKPPLEKRKLPAYTGMAHFVSHFAEPGDPEYAPPVPKCETRAEKKARIRDNKLEQGAAKVAEELQKYDPQSDPNATGDPYKTLFVARLNYETSENKIKREFEAYGPIKRVRLVTEKDTSKPRGYAFIEYMHTRDMKNAYKQADGRKVDNKRVLVDVERGRTVPNWRPRRLGGGLGSSRMGGAETDKKDSAREQQQGGRPRSEEPRRDDRRADRDREKSRERVRERDRDERARERSHDRTRDRDSREEKHHHRDRERTRDRERGKDREREHGRDRDRRDRDRDRDRGRDYERETDRARSHDRHRERGRDRGERDYERTSHERDRGHRHERDADYGNGGPKHDKNLSSYGQDYGYGQYEQHKGHEAYGYGQDGRGHETEHSKRHDQEYYRVDSYSKMETNYQVQPNNAEPEGPEEGEAYEEGDYQYHRAGEHMNDA